Kanawha County
Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
He, Qi, Qian, Cheng, Chen, Xiusi, He, Bingxiang, Fung, Yi R., Ji, Heng
Claim verification with large language models (LLMs) has recently attracted growing attention, due to their strong reasoning capabilities and transparent verification processes compared to traditional answer-only judgments. However, existing approaches to online claim verification, which requires iterative evidence retrieval and reasoning, still mainly rely on prompt engineering or pre-designed reasoning workflows, without unified training to improve necessary skills. Therefore, we introduce Veri-R1, an online reinforcement learning (RL) framework that enables an LLM to interact with a search engine and to receive reward signals that explicitly shape its planning, retrieval, and reasoning behaviors. This dynamic interaction of LLM with retrieval systems more accurately reflects real-world verification scenarios and fosters comprehensive verification skills. Empirical results show that Veri-R1 improves joint accuracy by up to 30% and doubles the evidence score, often surpassing its larger-scale model counterparts. Ablation studies further reveal the impact of reward components, and the link between output logits and label accuracy. Our results highlight the effectiveness of online RL for precise and faithful claim verification, providing an important foundation for future research. We release our code to support community progress in LLM empowered claim verification.
- North America > United States > West Virginia > Kanawha County > Charleston (0.14)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- North America > United States > Illinois > Champaign County > Urbana (0.04)
- Asia > China > Hong Kong (0.04)
- Research Report > New Finding (0.86)
- Instructional Material > Online (0.61)
- Leisure & Entertainment > Sports (0.46)
- Law (0.46)
- Government (0.46)
Online Sparsification of Bipartite-Like Clusters in Graphs
Das, Joyentanuj, De, Suranjan, Sun, He
Graph clustering is an important algorithmic technique for analysing massive graphs, and has been widely applied in many research fields of data science. While the objective of most graph clustering algorithms is to find a vertex set of low conductance, a sequence of recent studies highlights the importance of the inter-connection between vertex sets when analysing real-world datasets. Following this line of research, in this work we study bipartite-like clusters and present efficient and online sparsification algorithms that find such clusters in both undirected graphs and directed ones. We conduct experimental studies on both synthetic and real-world datasets, and show that our algorithms significantly speedup the running time of existing clustering algorithms while preserving their effectiveness.
- North America > United States > West Virginia > Kanawha County (0.04)
- North America > United States > Virginia (0.04)
- North America > United States > Arizona > Maricopa County (0.04)
- (2 more...)
WavePulse: Real-time Content Analytics of Radio Livestreams
Mittal, Govind, Gupta, Sarthak, Wagle, Shruti, Chopra, Chirag, DeMattee, Anthony J, Memon, Nasir, Ahamad, Mustaque, Hegde, Chinmay
Radio remains a pervasive medium for mass information dissemination, with AM/FM stations reaching more Americans than either smartphone-based social networking or live television. Increasingly, radio broadcasts are also streamed online and accessed over the Internet. We present WavePulse, a framework that records, documents, and analyzes radio content in real-time. While our framework is generally applicable, we showcase the efficacy of WavePulse in a collaborative project with a team of political scientists focusing on the 2024 Presidential Elections. We use WavePulse to monitor livestreams of 396 news radio stations over a period of three months, processing close to 500,000 hours of audio streams. These streams were converted into time-stamped, diarized transcripts and analyzed to track answer key political science questions at both the national and state levels. Our analysis revealed how local issues interacted with national trends, providing insights into information flow. Our results demonstrate WavePulse's efficacy in capturing and analyzing content from radio livestreams sourced from the Web. Code and dataset can be accessed at \url{https://wave-pulse.io}.
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- North America > United States > New York > Kings County > New York City (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- (215 more...)
- Media > Radio (1.00)
- Leisure & Entertainment (1.00)
- Government > Voting & Elections (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models
Li, Loka, Chen, Zhenhao, Chen, Guangyi, Zhang, Yixuan, Su, Yusheng, Xing, Eric, Zhang, Kun
The recent success of Large Language Models (LLMs) has catalyzed an increasing interest in their self-correction capabilities. This paper presents a comprehensive investigation into the intrinsic self-correction of LLMs, attempting to address the ongoing debate about its feasibility. Our research has identified an important latent factor - the "confidence" of LLMs - during the self-correction process. Overlooking this factor may cause the models to over-criticize themselves, resulting in unreliable conclusions regarding the efficacy of self-correction. We have experimentally observed that LLMs possess the capability to understand the "confidence" in their own responses. It motivates us to develop an "If-or-Else" (IoE) prompting framework, designed to guide LLMs in assessing their own "confidence", facilitating intrinsic self-corrections. We conduct extensive experiments and demonstrate that our IoE-based Prompt can achieve a consistent improvement regarding the accuracy of self-corrected responses over the initial answers. Our study not only sheds light on the underlying factors affecting self-correction in LLMs, but also introduces a practical framework that utilizes the IoE prompting principle to efficiently improve self-correction capabilities with "confidence". The code is available at https://github.com/MBZUAI-CLeaR/IoE-Prompting.git.
- Europe > Norway (0.14)
- North America > United States > California (0.14)
- North America > Canada > British Columbia (0.14)
- (35 more...)
Bot or Human? Detecting ChatGPT Imposters with A Single Question
Wang, Hong, Luo, Xuan, Wang, Weizhi, Yan, Xifeng
Large language models like ChatGPT have recently demonstrated impressive capabilities in natural language understanding and generation, enabling various applications including translation, essay writing, and chit-chatting. However, there is a concern that they can be misused for malicious purposes, such as fraud or denial-of-service attacks. Therefore, it is crucial to develop methods for detecting whether the party involved in a conversation is a bot or a human. In this paper, we propose a framework named FLAIR, Finding Large language model Authenticity via a single Inquiry and Response, to detect conversational bots in an online manner. Specifically, we target a single question scenario that can effectively differentiate human users from bots. The questions are divided into two categories: those that are easy for humans but difficult for bots (e.g., counting, substitution, positioning, noise filtering, and ASCII art), and those that are easy for bots but difficult for humans (e.g., memorization and computation). Our approach shows different strengths of these questions in their effectiveness, providing a new way for online service providers to protect themselves against nefarious activities and ensure that they are serving real users. We open-sourced our dataset on https://github.com/hongwang600/FLAIR and welcome contributions from the community to enrich such detection datasets.
- North America > United States > Wyoming > Laramie County > Cheyenne (0.04)
- North America > United States > Wisconsin > Dane County > Madison (0.04)
- North America > United States > West Virginia > Kanawha County > Charleston (0.04)
- (5 more...)
News at a glance
SCI COMMUN### Conservation A company seeking to build a controversial gold and copper mine in Alaska won a major victory on 24 July when the U.S. Army Corps of Engineers issued an environmental analysis saying the mine wouldn't endanger the world's most productive sockeye salmon fishery. The decision clears the way for the Corps to issue permits needed by promoters of the Pebble Mine, located at the headwaters of two major watersheds that form part of the Bristol Bay salmon runs, just north of the Aleutian Islands. Environmental and Native Alaskan groups and some salmon scientists blasted the new study, saying it understated risks by focusing on the mine's small, initial footprint over 20 years of mining rather than its potential impacts if it expands to become one of the world's largest gold and copper mines, as its promoters hope. Mine backers have said such an expansion would get a closer environmental review later if they pursue it. Scientists have raised concerns that even the smaller mine could have wide impacts, because the resilience of the salmon runs hinges on access to a wide variety of spawning habitats. Environmental groups have vowed to file lawsuits to block the project. 90% —Accuracy of a new artificial intelligence system trained to identify individual weaver birds, which human birders generally cannot tell apart unless they are tagged ( Methods in Ecology and Evolution ). ### Planetary science China's first independent mission to Mars blasted off from the Xichang Satellite Launch Center on 23 July. To arrive in February 2021, Tianwen-1, a “quest for heavenly truth,” comprises an orbiter, lander, and rover. Only the United States and the Soviet Union have successfully landed on Mars. Instruments on the three Tianwen-1 craft will study the planet's magnetic field and atmosphere, map its surface, and characterize its geology. Tianwen-1 is the second in a trio of fresh martian missions: The United Arab Emirates launched its Hope orbiter on 19 July, and NASA planned to launch its Perseverance rover as early as 30 July, after Science went to press. ### Funding A bill in France would increase research spending over the next 10 years and add tenure-track faculty positions, a novelty in France. But critics say the plan's increases would be too small and slow. By 2030, the annual public research budget would rise by about one-third, to €20 billion, toward a goal of lifting overall R&D spending from 2.2% of gross domestic product to 3%. The National Research Agency, which funds researchers through competitive calls, would get €1 billion more over 7 years, reaching about €1.7 billion in 2027, to help raise its grant success rates from 16% to a target of 30%. The new, nonpermanent tenure-track positions would complement the permanent entry-level research positions traditionally offered by the French system, but critics fear the growth may lead to a decline in the permanent ones. Parliament is expected to approve the bill. ### Drug trials A monoclonal antibody given to babies has strongly protected them from severe disease caused by respiratory syncytial virus (RSV), a leading cause of infant death. As reported this week in The New England Journal of Medicine , a placebo-controlled study of nearly 1500 babies born preterm—who are at higher risk of severe symptoms of RSV—in 23 countries found that a single injection of the antibody before RSV season starts in the fall led to 78.4% fewer hospitalizations for lower respiratory infections associated with the disease. The antibody, being developed by AstraZeneca and Sanofi Pasteur, could replace one now on the market that is rarely used. (It is recommended only for infants at highest risk, requires five shots, and is very expensive.) The companies plan to seek regulatory approval of the new prophylaxis if larger studies now underway in preterm and full-term infants confirm that it is safe and effective. ### Graduate studies The American Astronomical Society last week launched the Astronomy Genealogy Project, which maps 5000 astronomers to their academic “descendants”—the 28,000 doctorate recipients they supervised. The discipline's family tree, at astrogen.aas.org, stretches back to 1766, but half of the listed doctorates were awarded since 2002. Organizers hope the data will help historians and sociologists of science analyze patterns across countries, universities, and subfields. U.S. universities awarded slightly more than half of the doctorates listed, and about two-thirds of the theses are online. ### Climate Environmental groups last week denounced as weak a plan announced by the U.S. Environmental Protection Agency (EPA) to limit greenhouse gas emissions from aircraft. The new standard would match an existing one adopted in 2016 by a U.N. body, the International Civil Aviation Organization (ICAO), that required emissions cuts by 2028. But recently manufactured planes already meet the standard, and EPA conceded its new rule would not reduce overall airplane emissions. Manufacturers have supported such a U.S. regulation to help them meet ICAO certification requirements. IACO has predicted that even under its standard, airplane emissions will grow by at least 3% a year globally. U.S. aviation accounts for 3% of the country's greenhouse gas emissions. ### Extreme life Bacteria from seafloor sediments buried 101 million years ago have been grown in the lab, raising the possibility they are as old as their muddy home. They had somehow survived in an area of the Pacific Ocean almost devoid of organic matter or other nutrients most bacteria need, although the sediments recovered do contain oxygen, the researchers report in Nature Communications . The finding pushes back the documented age of bacteria living in marine sediment from 15 million years and provides new insights on the limits of life under extreme conditions. A team led by researchers from the Japan Agency for Marine-Earth Science and Technology harvested the microbes from core samples drilled up to 5700 meters below sea level and took precautions against contaminating them with modern bacteria. The group argues the microbes likely didn't have enough food to keep replicating, and instead may have survived for eons without dividing by repairing age-related cellular damage. The microbes identified are known members of more than eight bacterial groups, many of which are commonly found elsewhere on Earth. ### Biotechnology Scientists announced last week that they used CRISPR gene editing to modify a cow embryo so that the resulting calf, named Cosmo, should produce more offspring bearing male traits. Bulls are 15% more efficient than cows at converting feed into weight gain, so the new method may allow cattle farmers to raise fewer cattle, benefiting the environment, say the researchers at the University of California, Davis. The researchers inserted a gene called SRY , which initiates male development and is normally found on the male sex chromosome, into an embryo's chromosome 17. Next, the researchers plan to determine whether Cosmo's offspring that inherit the SRY gene look and grow like males. Fifty percent of the calf's progeny will naturally be male; another 25% will be genetically female but will carry the SRY gene. ### Conservation Florida's governor this month signed a bill to establish a 162,000-hectare marine sanctuary in the Gulf of Mexico and protect one of the state's last remaining stretches of seagrass. Florida's coast boasts the most continuous expanse of seagrass beds in the United States, but these diverse habitats, home to blue crabs and manatees, have been damaged by nutrient-driven algal blooms and boat propellers. Authorities plan to create a management plan for the new Nature Coast Aquatic Preserve to balance protection with ecotourism, boating, and fishing. ### A magic ride for science In 1984, artist Bruce Degen met writer Joanna Cole at a publisher's office in New York City to discuss creating a children's book about science. They went on to collaborate and publish 13 colorful, zany books in The Magic School Bus series, featuring the ebullient, intrepid teacher Ms. Frizzle (above, right), who takes her students on fantastic adventures into the ocean, across the Solar System, and through the human body, for example. Cole died on 12 July at age 75. But the series continues to teach young readers and their parents about the natural world. > Q: Did you expect to create such a legacy? > A: I was in art school doing very serious art, and I realized that, in my heart of hearts, I wanted to do children's books. In the beginning, it was darn hard work. Some book sketch dummies have five layers of rewrites and reillustrations. The first book was a one-book contract to see if this would work. [The reception] was like the world was waiting for somebody to make this happen. People say, “[As a child,] I [used to] read these books, now I read them to my kids!” I could never have imagined it. > Q: Does scientific accuracy get in the way of storytelling? > A: Frequently. You have to tell kids what is true, but you can't give them all the truth—it's too much. For example, the evolution book goes from now [back] to the beginning of the Earth. I [initially] tried to show every era, year, and life form. It was too complicated. So it ended up as a nice, open spiral with a few representations of each era. > Q: Why use the format of adventures? > A: By following the story, it gave kids a mental filing system—they could retrieve and remember information because it was given to them in a memorable trip. ### Dispatches from the pandemic Read additional Science coverage of the pandemic at [sciencemag.org/tags/coronavirus][1]. #### U.S. vaccine efficacy trials begin The first large-scale efficacy trials of COVID-19 vaccines in the United States began last week. On 27 July, the National Institutes of Health, working with Moderna, announced the start of one that aims to recruit 30,000 people. Later that day, a partnership between Pfizer and BioNTech announced separately it was launching a similarly sized study at sites in the United States and elsewhere. Both the Moderna and the Pfizer/BioNTech vaccines contain messenger RNA that prompts cells to make a protein that studs the surface of the COVID-19 virus. If the vaccines work, this viral protein will safely teach the immune system how to battle the virus if a person later is exposed to it. Operation Warp Speed, the Trump administration's push to accelerate development of a COVID-19 vaccine, has committed nearly $3 billion to these two R&D projects, about half its total investment. Other efficacy trials of various COVID-19 vaccines have begun in Brazil, the United Arab Emirates, and the United Kingdom. Results are expected in late fall at the earliest. #### CDC slammed over school rules Guidance issued by the U.S. Centers for Disease Control and Prevention (CDC) last week for safely reopening schools downplays risks that teachers, other staff members, and students will spread or contract COVID-19, many public health specialists say. Provoking claims that CDC's advice had been politicized, the agency revised an earlier draft that President Donald Trump had panned as “very tough and expensive.” The nonbinding recommendations, released 23 July, emphasize the social and developmental benefits of in-person schooling and highlight that young children are at low risk for contracting the disease and transmitting the virus that causes it. The document also recommends against screening students for symptoms. Nevertheless, the Trump administration did advise communities with high infection rates to consider not beginning in-person classes. Large school districts, such as those in Atlanta, Houston, Los Angeles, and San Diego, have already announced that they will begin the 2020–21 school year with online instruction only. #### Anti-Fauci TV segment canceled Following heavy criticism from scientists and others, Sinclair Broadcast Corp. this week canceled plans for its chain of local TV stations to air a segment featuring widely challenged accusations that Anthony Fauci, director of the National Institute of Allergy and Infectious Diseases, intentionally created the virus responsible for COVID-19 and sent it to China. Fauci has helped lead the U.S. effort to control the pandemic despite tangling with President Donald Trump. The allegation came from Judy Mikovits, a virologist and antivaccine activist who appears in a documentary about the coronavirus that was also widely debunked as false and misleading (). The Sinclair segment, a new interview with Mikovits, was available online until the company pulled it for review on 25 July, after Media Matters reported its existence. Sinclair announced on 27 July that it would not air the segment on the nearly 200 TV stations it owns or operates in 89 U.S. markets—but not before one in Charleston, West Virginia, had broadcast it. [1]: http://sciencemag.org/tags/coronavirus
- Asia > Japan (0.53)
- Europe > France (0.51)
- Asia > Middle East > UAE (0.44)
- (14 more...)
- Research Report > Strength High (0.87)
- Research Report > Experimental Study (0.87)
Watch the donut not the hole. Welcome to the New AI winter? - TrustNoRobot
"As you ramble through Life, Brother, Whatever be your goal. Keep your eye upon the doughnut, And not upon the hole." Advice for those who drink coffee and eat "sinkers." Artificial Intelligence is not new. Another thing that is not new is people over-promising and under-delivering results around AI.
- North America > United States > West Virginia > Kanawha County > Charleston (0.05)
- Europe > United Kingdom > England (0.05)
- Health & Medicine (1.00)
- Government > Regional Government (0.31)
Trump aide says president weighing regulations on Google search engine that he considers 'rigged'
President Donald Trump is talking about the Iowa college student that was found slain about a month after she disappeared, despite the victim's family asking that her death not be politicized. President Donald Trump speaks during a rally in Charleston, W.Va. Tuesday. WASHINGTON – White House economic adviser Larry Kudlow said Tuesday that President Donald Trump is considering new regulations on Google's search engine to address his concern that it turns up too many stories that are critical of him. Pressed by reporters at the White House on Tuesday about a tweet the president wrote criticizing Google's search engine as "rigged," the director of Trump's National Economic Council said the administration is "taking a look" at federal regulations for the company. "We'll let you know," he said.
- North America > United States > West Virginia > Kanawha County > Charleston (0.26)
- North America > United States > Iowa (0.26)
- North America > United States > Arizona (0.06)
- Law > Statutes (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
One Candidate's Plan to Resist Trump by Teaching Kids to Code
Alec Ross knows Trump country well. The former Obama administration staffer hails from the heart of coal country in Charleston, West Virginia. He grew up alongside the very people that President Trump likes to say Washington has left behind. As with Trump, Ross believes that government needs to do a better job lifting up these "forgotten men and women." Unlike Trump, Ross believes accomplishing that goal has little to do with sealing off the borders or reviving the coal industry at the expense of the world's climate.
- North America > United States > West Virginia > Kanawha County > Charleston (0.25)
- North America > United States > California (0.16)
- North America > United States > Maryland (0.09)
- North America > United States > Arkansas (0.06)
- Government > Regional Government > North America Government > United States Government (1.00)
- Education (1.00)
- Government > Voting & Elections (0.98)